Picture for Mahdi Karami

Mahdi Karami

Auto-Regressive Masked Diffusion Models

Add code
Jan 23, 2026
Viaarxiv icon

Trellis: Learning to Compress Key-Value Memory in Attention Models

Add code
Dec 29, 2025
Viaarxiv icon

MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling

Add code
Dec 29, 2025
Viaarxiv icon

TNT: Improving Chunkwise Training for Test-Time Memorization

Add code
Nov 10, 2025
Viaarxiv icon

Lattice: Learning to Efficiently Compress the Memory

Add code
Apr 08, 2025
Viaarxiv icon

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Add code
Nov 23, 2024
Viaarxiv icon

Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling

Add code
Feb 28, 2024
Figure 1 for Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Figure 2 for Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Figure 3 for Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Figure 4 for Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Viaarxiv icon

HiGen: Hierarchical Graph Generative Networks

Add code
May 30, 2023
Viaarxiv icon

HiGeN: Hierarchical Multi-Resolution Graph Generative Networks

Add code
Mar 06, 2023
Figure 1 for HiGeN: Hierarchical Multi-Resolution Graph Generative Networks
Figure 2 for HiGeN: Hierarchical Multi-Resolution Graph Generative Networks
Figure 3 for HiGeN: Hierarchical Multi-Resolution Graph Generative Networks
Figure 4 for HiGeN: Hierarchical Multi-Resolution Graph Generative Networks
Viaarxiv icon